Index Support for SPARQL
نویسندگان
چکیده
The Resource Description Framework (RDF) is the fundamental data model underlying the Semantic Web. Recently, SPARQL has been proposed as W3C standard for querying RDF repositories. As RDF is a graph-based data model, the core problem of evaluating SPARQL queries is matching query graph patterns against the data graph, which is computationally very expensive. We address this problem by indexing graph patterns. In the spirit of SQL, we assume that users select the patterns to be indexed. We formally define the problem of covering indexes, i.e., finding those indexes whose graph-patterns are contained in the query pattern, and derive formulas for estimating index selectivity. Finally, we study the problem of finding optimal sets of indexes for a given query. We believe that our framework is the first comprehensive suggestion for indexing RDF for SPARQL queries that goes beyond simple indexing of labels.
منابع مشابه
SPACE: SPARQL Index for Efficient Autocompletion
Querying Linked Data means to pose queries on various data sources without information about the data and the schema of the data. This demo shows SPACE, a tool to support autocompletion for SPARQL queries. It takes as input SPARQL query logs and builds an index structure for efficient and fast computation of query suggestions. To demonstrate SPACE, we use available query logs from the USEWOD Da...
متن کاملSAMUEL: A Sharing-based Approach to processing Multiple SPARQL Queries with MapReduce
The volume of RDF data is now growing tremendously. It is thus considered prudent to store and process massive RDF data with distributed SPARQL engines instead of relying on a singlemachine system.Many sophisticated index and partitioning schemes have also been proposed to support SPARQL query evaluations. However, existing SPARQL engines have mainly followed oneat-a-time scheme so that query e...
متن کاملUsing an index of precomputed joins in order to speed up SPARQL processing
SparQL is a query language developed by the W3C, the purpose of which is to query a data set in RDF representing a directed graph. Many free available or commercial products already support SparQL processing. Current index-based optimizations integrated in these products typically construct indices on the subject, predicate and object of an RDF triple, which is a single datum of the RDF data, i...
متن کاملTowards Efficient SPARQL Query Processing on RDF Data
Efficient support for querying large-scale RDF triples plays an important role in Semantic Web data management. This paper proposes an efficient RDF query engine to evaluate SPARQL queries, where the inverted index structure is employed for indexing RDF triples. We first design and implement a set of operators on the inverted index for query optimization and evaluation. Then we propose a main-t...
متن کاملSubstring Filtering for Low-Cost Linked Data Interfaces
Recently, Triple Pattern Fragments (tpfs) were introduced as a low-cost server-side interface when high numbers of clients need to evaluate sparql queries. Scalability is achieved by moving part of the query execution to the client, at the cost of elevated query times. Since the tpf interface purposely does not support complex constructs such as sparql filters, queries that use them need to be ...
متن کامل